Your browser doesn't support javascript.
Show: 20 | 50 | 100
Results 1 - 3 de 3
Filter
1.
ACM Web Conference 2023 - Companion of the World Wide Web Conference, WWW 2023 ; : 1204-1207, 2023.
Article in English | Scopus | ID: covidwho-20239230

ABSTRACT

Timeline summarization (TLS) is a challenging research task that requires researchers to distill extensive and intricate temporal data into a concise and easily comprehensible representation. This paper proposes a novel approach to timeline summarization using Meaning Representations (AMRs), a graphical representation of the text where the nodes are semantic concepts and the edges denote relationships between concepts. With AMR, sentences with different wordings, but similar semantics, have similar representations. To make use of this feature for timeline summarization, a two-step sentence selection method that leverages features extracted from both AMRs and the text is proposed. First, AMRs are generated for each sentence. Sentences are then filtered out by removing those with no named-entities and keeping the ones with the highest number of named-entities. In the next step, sentences to appear in the timeline are selected based on two scores: Inverse Document Frequency (IDF) of AMR nodes combined with the score obtained by applying a keyword extraction method to the text. Our experimental results on the TLS-Covid19 test collection demonstrate the potential of the proposed approach. © 2023 ACM.

2.
43rd European Conference on Information Retrieval Research, ECIR 2021 ; 12656 LNCS:497-512, 2021.
Article in English | Scopus | ID: covidwho-1265431

ABSTRACT

The rise of social media and the explosion of digital news in the web sphere have created new challenges to extract knowledge and make sense of published information. Automated timeline generation appears in this context as a promising answer to help users dealing with this information overload problem. Formally, Timeline Summarization (TLS) can be defined as a subtask of Multi-Document Summarization (MDS) conceived to highlight the most important information during the development of a story over time by summarizing long-lasting events in a timely ordered fashion. As opposed to traditional MDS, TLS has a limited number of publicly available datasets. In this paper, we propose TLS-Covid19 dataset, a novel corpus for the Portuguese and English languages. Our aim is to provide a new, larger and multi-lingual TLS annotated dataset that could foster timeline summarization evaluation research and, at the same time, enable the study of news coverage about the COVID-19 pandemic. TLS-Covid19 consists of 178 curated topics related to the COVID-19 outbreak, with associated news articles covering almost the entire year of 2020 and their respective reference timelines as gold-standard. As a final outcome, we conduct an experimental study on the proposed dataset over two extreme baseline methods. All the resources are publicly available at https://github.com/LIAAD/tls-covid19. © 2021, Springer Nature Switzerland AG.

3.
2020 ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL 2020 ; : 459-460, 2020.
Article in English | Scopus | ID: covidwho-916310

ABSTRACT

The new pandemic disease caused by COVID-19 virus is the crucial event over the world in the beginning of 2020. Studies on corona viruses have been however carried since several decades ago, with recent research papers published on weekly basis. We demonstrate a simple approach to explore CORD-19 dataset to provide a high level overview of important semantic changes that occurred over time. Our method aims to support better understanding of large domain-specific collections of scholarly publications that span long time periods and could be regarded as complementary to frequencybased analysis. © 2020. ACM ISBN.

SELECTION OF CITATIONS
SEARCH DETAIL